FILTER MODE ACTIVE

#speech synthesis

Records found: 8

#speech synthesis23/01/2026

Qwen3-TTS: Open Multilingual TTS Suite with Real-Time Latency

Explore Alibaba Cloud's Qwen3-TTS, a multilingual TTS suite with voice control and real-time response.

#speech synthesis09/11/2025

Build an Agentic Voice AI That Understands, Plans, and Speaks Autonomously

'Tutorial shows how to assemble a real-time voice AI agent that transcribes, reasons, plans and speaks using Whisper and SpeechT5.'

READ →

#speech synthesis29/08/2025

Microsoft Unveils MAI-Voice-1 and MAI-1-Preview — New In-House Voice and Language Models

Microsoft AI Lab released MAI-Voice-1 for fast, high-fidelity speech generation and MAI-1-preview, a homegrown foundation language model optimized for conversational tasks and gradual product integration

READ →

#speech synthesis25/08/2025

VibeVoice-1.5B: Microsoft’s Open TTS for 90-Minute, Multi-Speaker Audio

'Microsoft released VibeVoice 1.5B, an open source TTS model that generates up to 90 minutes of expressive audio with up to four speakers and supports cross lingual and singing synthesis.'

READ →

#speech synthesis05/07/2025

Kyutai Unveils Ultra-Low Latency 2B Parameter Streaming Text-to-Speech Model Trained on 2.5M Hours

Kyutai has launched a groundbreaking streaming TTS model with 2 billion parameters, achieving 220ms latency and trained on 2.5 million hours of speech. This open-source model supports multiple users and real-time applications, advancing speech AI technology.

READ →

#speech synthesis21/06/2025

From Robotic Tones to Lifelike Voices: The Remarkable Journey of AI Speech

Discover how AI voices have evolved from robotic tones to natural, human-like speech, transforming fields like accessibility, entertainment, and customer support.

READ →

#speech synthesis06/05/2025

LLaMA-Omni2: China’s Breakthrough in Real-Time Speech-Enabled Large Language Models

Chinese researchers release LLaMA-Omni2, a modular speech language model that enables real-time spoken dialogue with minimal latency and strong performance using compact training data.

READ →

#speech synthesis29/04/2025

VERSA: The Ultimate Toolkit Revolutionizing Speech, Audio, and Music Evaluation

VERSA is a new, versatile evaluation toolkit integrating 65 metrics for speech, audio, and music assessment, offering unprecedented flexibility and standardization in generative audio evaluation.

READ →